A large-scale prediction of protein-protein interactions based on random forest and matrix of sequence
نویسندگان
چکیده
Protein-protein interaction (PPIs) is an important part of many life activities in organisms, and the prediction protein-protein interactions closely related to protein function, disease occurrence, treatment. In order optimize performance interactions, here a RT-MOS model was constructed based on Random Forest (RF) Matrix Sequence (MOS) predict interactions. Firstly, MOS used encode sequences into 29-dimensional feature vector; Then, build random forest, optimized evaluated using test set; Finally, for prediction. The experimental results show that accuracy rates benchmark dataset non-redundant are 97.18% 91.34%, respectively, accuracies four external datasets C.elegans, Drosophila, E.coli H.sapiens 96.21%, 97.86%, 97.54% 97.75%, respectively. Compared with existing methods, it found superior methods. has advantages saving time, preventing overfitting high accuracy, suitable large-scale PPIs
منابع مشابه
: the effect of sericin levels (silk glue protein) on rate of in vitro maturation, fertilization and culture of sheep oocytes
هدف از آزمایش اول بررسی اثر سطوح مختلف سریسین [0 (control), 0.1, 0.5, 1.0, 2.5 %] افزوده شده به محیط , ivm بر cumulus cell expansion، بلوغ هسته و توسعه متوالی جنین، در گوسفندان نژاد سنجابی در فصل تولید مثلی می باشد. از سرگیری میوز به وسیله خارج شدن اولین پولار بادی اندازه گیری و هم چنین درصد رسیدن جنین های دو سلولی به مرحله کلیواژ و بلاستوسیت نیز به عنوان نشانه ای از میزان شایستگی توسعه اولیه ج...
Prediction of protein-protein interactions using random decision forest framework
MOTIVATION Protein interactions are of biological interest because they orchestrate a number of cellular processes such as metabolic pathways and immunological recognition. Domains are the building blocks of proteins; therefore, proteins are assumed to interact as a result of their interacting domains. Many domain-based models for protein interaction prediction have been developed, and prelimin...
متن کاملSequence-based prediction of RNA-protein interactions
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xxiv CHAPTER 1. OVERVIEW . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1.1 Dissertation Organization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 1.2 Experimental Methods to Identify RNA-Protein Interactions . . . . . . . . . . 4 1.3 Computational Prediction of RNA-Protein Interfaces ....
متن کاملSeeing the trees through the forest: sequence-based homo- and heteromeric protein-protein interaction sites prediction using random forest
Motivation Genome sequencing is producing an ever-increasing amount of associated protein sequences. Few of these sequences have experimentally validated annotations, however, and computational predictions are becoming increasingly successful in producing such annotations. One key challenge remains the prediction of the amino acids in a given protein sequence that are involved in protein-protei...
متن کاملDetecting Protein-Protein Interactions with a Novel Matrix-Based Protein Sequence Representation and Support Vector Machines
Proteins and their interactions lie at the heart of most underlying biological processes. Consequently, correct detection of protein-protein interactions (PPIs) is of fundamental importance to understand the molecular mechanisms in biological systems. Although the convenience brought by high-throughput experiment in technological advances makes it possible to detect a large amount of PPIs, the ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: BIO web of conferences
سال: 2022
ISSN: ['2273-1709', '2117-4458']
DOI: https://doi.org/10.1051/bioconf/20225501017